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Attempts to understand the relationship between diversity, productivity and scale have remained limited 
due to the scheme- dependent nature of the taxonomies describing complex systems. We analyze the 
diversity of US metropolitan areas in terms of profession diversity and employment to show how this 
frequency distribution takes a universal scale-invariant form, common to all cities, in the limit of infinite 
resolution of occupational taxonomies. We show that this limit is obtained under general conditions that 
follow from the analysis of the variation of the occupational frequency across taxonomies at different 
resolutions in a way analogous to finite-size scaling in statistical physical systems. We propose a theoretical 
framework that derives the form and parameters of the limiting distribution of professions based on the 
appearance, in urban social networks, of new occupations as the result of specialization and coordination of 
labor. By deriving classification scheme-independent measures of functional diversity and modeling cities as 
social networks embedded in infrastructural space, these results show how standard economic arguments of 
division and coordination of labor can be articulated in detail in cities and provide a microscopic basis for 
explaining increasing returns to population scale observed at the level of entire metropolitan areas. 

A fundamental theme across the study of complex systems 1 " 3 -from ecosystems 4 ' 5 to human behavior 1,7 
and socioeconomic organization 3 ' 8 ' 9 - deals with the mechanisms by which diversity arises and is 
sustained. 

Diversity means generally that component agents of a large system are heterogeneous so that they exhibit 
different traits and perform different functions. For all its ubiquity and importance, the functional role of 
heterogeneity in complex systems has remained poorly understood. General lines of argument for explaining 
diversity can be roughly divided into two classes: those that emphasize its protective role in uncertain environ- 
ments and those that invoke direct gains in productivity as a result of functional heterogeneity 5,6 . 

On the one hand, heterogeneity may confer collectives with resilience to selective shocks through a hedging 
effect that allows some components of the system to survive in times of crisis 10 . For example, cities that have 
remained very specialized (in manufacturing) have historically fared poorly over the long run. This effect already 
speaks of the importance of heterogeneity in enabling differential selection and therefore speeding up "evolu- 
tion" 1112 , whether in ecosystems or in markets. Selection, in this picture, remains a function of the external 
environment and confers no immediate incentive on particular individuals to differentiate: It develops because its 
effects, in retrospect, are necessary for the group's survival. 

On the other hand, a more direct and potentially more fundamental link between diversity, productivity and 
scale may exist. This connection has often been been invoked, both in ecosystems and in economics, to explain 
why larger systems tend to be at once more diverse and more productive 3 " 5 ' 8 " 1013 . Nevertheless, the formalization 
of these ideas remains incomplete and does not yet explain quantitative patterns in observed levels of diversity 3 ' 14 . 
The key concept is that diversity is both a consequence and an enabler of scale-dependent system organization 
and that it is directly associated with higher individual productivity. As a consequence, higher heterogeneity can 
be positively selected endogenously by agents and organizations that benefit directly from such strategies. Here, 
we show how diversity, productivity and social network structure can be combined at the microscopic level to 
generate an open-ended dynamics of growth, differentiation and complexification in cities. 

In contemporary human societies socioeconomic diversity is associated primarily with large cities 7 ' 15 . This 
reflects the primary role of urban environments in promoting the production of new ideas and stimulating 
socioeconomic development 31516 . However, counter-arguments have also emerged noting that specialized cities 
are, at specific times, more productive 16 " 19 . Familiar examples are contemporary Silicon Valley or manufacturing 
cities in their heyday such as Manchester or Detroit. These questions remain far from settled, in part because of 
the difficulties inherent to measuring diversity in any complex system 2,20 . 



SCIENTIFIC REPORTS | 4:5393 | DOI: 1 0.1 038/srep05393 



1 



Empirically, diversity has remained difficult to characterize un- 
ambiguously. Most measures account for the presence, and some- 
times the relative proportion 21 " 23 , of different functional types: For 
example different professions or business types in cities or nations, or 
different species in an ecosystem 24 . Such measures of diversity, are 
inevitably linked to particular classification schemes or taxonomies. 
To appreciate this point, consider the question: How many different 
professions are there in a large city, like New York? In general, there 
is no objective answer as it depends on how finely one differentiates 
similar functions. However, under general conditions, a limit of infi- 
nite taxonomic resolution can be inferred through an analysis ana- 
logous to that of statistical physical quantities near phase 
transitions 25 : If such a limit exists, taxonomic scheme-independent 
answers emerge. 

Here we show how addressing these two issues at once - measuring 
functional diversity and linking it to productivity by grounding it in 
scale-dependent urban social structure - can help shed light on the 
dynamics and structure of cities. As a result we are able to develop a 
predictive theoretical framework for the emergence of functional 
heterogeneity as a consequence of a process of division and coordina- 
tion of functions in social networks, stimulated by productivity gains 
and made possible by increases in population, that is: system size. 

Results 

We start by showing that the simplest measure of functional diversity 
in cities, the count of the number of distinct professions, is classifica- 
tion scheme dependent. We then show how a scheme-invariant 
measure can be obtained in the limit of infinite resolution via well- 
known methods of finite size scaling in statistical physics. This allows 
us to derive the general form of the distribution of professions in US 
urban areas, which we show applies to cities of all sizes and over time. 
We then develop a theoretical framework that derives the observed 
patterns of professional diversity from the maximization of eco- 
nomic productivity under social network constraints reflecting coor- 
dination costs at the metropolitan level. 

Classification resolution and the number of distinct professions. 

The simplest measure of diversity, D(N), counts the number of dis- 
tinct professions in a city. Fig. 1A shows D(N), for US metropolitan 
areas (MSAs) as a function of their total employment, N e . Because N e 
is, on average, proportional to population 26 , N, we use the two 
measures of scale interchangeably. We observe that D increases 
with N e initially and then saturates for large cities and is well fit by 



D(N e ) = d 0 



ft) 



1 + 



ft) 



(1) 



Eq. 1 holds over time (Fig. SI) and for different levels of resolution, r, 
in the occupations hierarchical classification scheme (see Supple- 
mentary Information). The parameters in Eq. 1 are, in general, 
functions of r. The scale d 0 (r) is the effective size of the 
classification scheme at resolution r, N 0 (r) is the characteristic size 
of the city at which saturation starts. The exponent (elasticity), y, is 
empirically independent of r: It expresses the proportionality between 
the population growth rate and that for new occupations in the city, 
in the absence of saturation (see Supplementary Information). 

Coarsening the hierarchical classification leads to similar satura- 
tion at each of the taxonomy's size d 0 (r 6 ) > d 0 (r 5 ) > d 0 (r 4 ), etc 
(Fig. IB). This behavior is the hallmark of a finite resolution artifact, 
a phenomenon well understood through finite-size scaling analysis 
in statistical physical systems 25 . The explicit dependence of d 0 on r 
suggests that existing classification schemes are too coarse to capture 
the true professional diversity of large US cities 27 , beyond an employ- 
ment size of order N e ~N 0 ~10 5 . Nevertheless, we can use the vari- 
ation of the statistics of occupations with r to derive classification 



scheme independent results. We reconcile all curves for D(N) at 

different r and extract their limit as r —> °o. We define a dimensionless 
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where D 0 is a constant. Comparison with Eq. 1 tells us that in the limit 
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universal scaling regime exists if and only if the quantity D 0 — , 

No(r) 

becomes a constant, independent of r, as r — » oo (Fig. IB). Fig. 1C 
shows d 0 vs. across r and over time. The relationship is well 
described by a straight-line with slope D 0 = 0.05 across all years. 

These results suggest the existence of a resolution independent, 
scale-invariant limit for D(N) and show that the occupational divers- 
ity of cities is in fact open-ended: the number of distinct occupations 
in US cities increases by y ~ 84% with each doubling of its labor 
force, meaning that larger cities are at once more diverse in absolute 
terms and more specialized per capita. These insights can be proven 
as simple theorem, see SI Text. 

Universal Distribution of Urban Professions. Beyond considering 
the presence or absence of professions, which gives only a crude 
measure of urban diversity, we can characterize their probability. 
To derive this distribution we identify D(N) in Eq. 1 with the 

maximum rank at each N, which has probability p(D) = —. 

Inverting this relation and generalizing it to all ranks, i, leads to 
the occupational frequency, fit): 



/(0= 



N e (d a — i 
No 



(2) 



This is also scheme independent in the large resolution limit and can 
be used to derive the probability density, p(i), as 



p(i)=; 



l-y 



-1/y 



(3) 



which is also independent of r. The occupational probability has a 
residual dependence on N through D(N) because the rarest 
professions in each city cannot have less than one person. Figs. 2A, 
B shows that the distribution of occupations for different cities is 
universal: When adjusted for scale, N e , the frequency curves for every 
city collapse onto a single line, Fig. 2B. This shows that there is an 
expected nested sequence of occupations predicted by city size, as 
expected by the hierarchy principle of central place theory 28 and its 
extensions in regional economics 23 ' 29 . This sequence of professions 
arising with growing city size is also analogous to the hierarchy of 
products vs. level of economic development observed at the national 
level 8,9 and shows how large cities play a fundamental role in 
embodying the economic diversity of nations. 

Indices of diversity. The diversity associated with the distribution of 
types is commonly measured by functionals of the probability dis- 
tribution, p(i). Examples are the Herfmdahl-Hirschman Index (H H) 
and the Shannon entropy (<*) 21 > 2224 . Here we show that the residual 
dependence of p(i) on N, through D(N) in Eq. 3 is the only source of 
city size dependence of these measures of occupational diversity. 

The Herfindahl-Hirschman index measures how concentrated a 
distribution is. For this reason it is often applied to economic sectors 
to measure their concentration in terms of firm's market share and 
assess the existence of monopolies. Given the asymptotic form of the 
distribution (Eq. 3) the H H(N) can be calculated analytically as 
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Figure 1 | The number of distinct occupations in US Metropolitan Statistical Areas vs. total employment. (A) The relationship between the number of 



professions present for each city (orange dots) and city size is well described by D(N e ) = d 0 



(N e /N 0 [ 



, with d 0 = 686, y = 0.84, N 0 = 1.48 X 10 5 (blue 



'l + (N e /N 0 ) 

line). (B) D(N e ) at different labels of resolution of the occupational classification scheme, r iy with i = 6 the finest and i = 3 the coarsest. (C) d 0 is 
proportional to Nq across levels of classification scheme resolution and time, suggesting the there is a r- independent limit to the form of the occupational 
diversity of cities and that D is open-ended. In this limit, D(N e ) = D 0 Nj and larger cities are always more diverse as a whole, but more specialized per 
capita. 
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where the exponent 3 = 1 — y, Consequently the H H index 
decreases with N towards a small constant, set by the exponent 3, 
as cities grow. This expresses an increase in diversity with city size. 
Note that the asymptotic value for o°, with 3 ~ 1 /6 30 , is H H — > 
0.028, which is typical of highly diverse (and competitive) markets. 

Similarly the Shannon entropy, S, measures the diversity of the 
occupational distribution as 



D(N) 



- 3/(1-3) 



N~ 



5 ln(Dj /y iv), (5 



which increases with N towards the Pareto distribution limit at infi- 
nite N. Thus, the increase in entropy signals the increase in diversity 
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of the occupational distribution in larger cities. Note that in both 
cases the increases in diversity are driven, at leading order, by a term 
of order N~ s (Fig. S3). As we shall see below, this term measures 
professional diversity per capita and is also implicated in the growth 
of labor productivity and social connectivity with city size. 

Diversity, Productivity and Urban Social Networks. We now 

address the problem of how professional diversity, economic 
productivity and urban social networks are interconnected. 

Given the empirical results so far, we should expect economic 
productivity to be inversely proportional to professional diversity. 
To see this more explicitly, consider that urban indicators of eco- 
nomic productivity (wages, GDP, etc) scale, on average, superlinearly 
with city size N, W(N, t) = W 0 (t)N(t) p , with W 0 (t) and jB ~ 1 + 3 > 1 
independent of N 26 , see Fig. 3 A. The average wage per capita is, then, 
w(N) = W 0 N 3 , where (5~1/6^1— y 30 . This result has been derived 
from a general theoretical framework that defines cities as co -located 
social networks, subject to infrastructural efficiency constraints 30 , 
with w = Gk(N), where G is a constant in N, involving a balance 
between mobility costs and infrastructural properties, and k(N) = 
k 0 N d is the average social connectivity (network degree) per person. 
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Figure 2 | The distribution of occupations in US metropolitan areas is universal. (A) Frequency distribution for several cities with different population 
sizes only differ in their amplitude, which is set by city size and the extent to which they probe rare occupations. The horizontal grey line shows the 
minimum number of professions (thirty) reported. (B) The rank-probability distributions for different cities collapse on each other when adjusted for city 

size (total employment). The yellow line shows the fit of the universal form to f(i)/N e = ^— { ~~r. 
ranks. The black line is the f of f[i)IN e in the absence of saturation. °^ ^ 



, where we introduces a scale i 0 ~ 3 at small 
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(This increase in social connectivity with city size has been observed 
directly in urban telecommunication networks 31 .) Similarly, diversity 
per capita, d(N) = D(N)/N = DqN 7 " 1 = D 0 N~ S . Hence, we conclude 
that w oc 1/d. This relationship is an expression of the abundant 
evidence in economics for knowledge specialization - a decrease in 
d(N) - as the source of increases in (labor) productivity 19,32 " 34 . 
However, this view of productivity per se cannot be the target of 
optimization: no city has become rich by reducing its occupational 
diversity to a single activity! What then is the optimal level of divers- 
ity and productivity attainable at the city level? 

Before we answer this question we note that the connection 
between the microscopic features of socioeconomic urban organiza- 
tion and the aggregate productivity of cities has remained a fun- 
damental but difficult question in urban economics. A number of 
detailed socioeconomic mechanisms have been proposed to explain 
increases in productivity with city size 16 , all of which depend on the 
social coordination opportunities that larger cities enable 36 . A differ- 
ent, if not orthogonal, set of arguments seeks to explain the greater 
productivity of larger cities from the empirical observation of a dis- 
proportionate presence of economic sectors with greater value- 
added or of greater human capital 35 . Despite some important 
theoretical attempts 33,34 , what is still missing from these two lines 
of argument is an empirically based predictive formalization of urban 
diversity that ties together learning, via the specialization of know- 
ledge (accumulation of human capital), to the coordination costs 
inherent to the division of labor in cities. 

We now propose such a formalization by showing how knowledge 
specialization is set by the extent and costs of human social networks 
in cities. We implement these ideas in terms of the structural change 
of urban social networks with city size by assuming a process 
whereby individuals start as generalists in small social environments 
and are able to specialize in larger cities by externalizing some of their 
functions to others, while maintaining access to them through direct 
social links. Thus, we assume that only close coordination (at least in 
the initial stages of this transfer of functions) will be able to preserve 
these specialized functions suitably integrated and, as such, that there 
must be a conservation of the number of functions within the imme- 
diate network contacts (first neighbors) of each individual (Fig. 3B). 
This means that as an individual specializes by performing fewer 
functions, she maintains access to such functions via socioeconomic 
contacts, so that the rate of individual specialization with city size 
must equal in magnitude the rate of diversification of functions 



across ones social ties, see below. Note that this process provides a 
(reversible) mechanism for the creation of human capital, through 
the gradual division of labor in larger social networks and is fun- 
damentally different from arguments of comparative advantage 
where differential endowments of distinct individuals or organiza- 
tions are taken as given. 

We formalize these ideas in terms of the condition that, on aver- 
age, as k(N) —j^ = k(N)d(N) = A, where A is a constant in N but 

that may vary over time, e.g. due to changes in communication and 
transportation technologies. This condition is equivalent to the con- 
servation of the total rate of change of k plus d with N: Aln k + Aln d 
= 0, where A denotes a population size variation. This conservation 
also enables each individual's capacity to innovate through processes 
of functional recombination resulting from re -wirings of ties in social 
networks. The main difference of this approach to that of Ref. 30, is to 
start from the more fundamental assumption that economic produc- 
tivity is proportional to (labor) specialization that is w ~ 1/d, and not 
just to social connectivity, k and is subject to network constraints. To 
preserve physical dimensions we write w = g(kd)/d, where g is a 
function that transforms the index of specialization lid into units 
of economic productivity (money/person/unit time). Its dependence 
on kd is necessary for consistency with observations. 

With these definitions, we can formulate the problem of deter- 
mining the optimal professional diversity of a city of size N in terms 
of the d that maximizes (labor) economic productivity, w, subject to 
the constraint that activities lost to an individual remain available 
through her neighbors in their social network. Note that because d = 
A/k, by assumption, that this is equivalent to finding the value of A 
and the function g(A) that maximize productivity. This can be writ- 
ten in terms of standard constrained optimization of a target function 
£, as 

C(d;A) = ^f^-A(kd-A), (6) 
d 

where k is a Lagrange multiplier. To proceed, we take k(N) as given 
and show how social connectivity and professional diversity must be 
related in order to maximize labor productivity. The solution follows 
by taking the variations of C relative d and k to zero: 

A A 1 g(A) g(A) . 

d= k = k 0 W> fc=— fcoN, (7) 




City size (log 1Q N e ) 



Figure 3 | The scaling of economic productivity with city size and the generation of professional diversity. (A) Superlinear scaling relation between total 
wages in US metropolitan ( W(N) = W 0 N 1+S , with S = 0.18 ± 0.03, in agreement with theoretical expectations of S ~ 1/6 30 ). Inset shows the 
product A = wd versus city size, demonstrating that A it is on average independent of city size (R 2 = 0.91). (B) The process of generation of new 
occupations as a function of city size: As the self-similar frequency distribution of occupations is pushed up by city size, latent occupations cross the lowest 
probability threshold and appear explicitly in D. The schematic shows how such a new explicit function (red circle) allow other occupations (yellow and 
green circles) to specialize further (acquire brighter colors), while remaining available to each other through network ties. 
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where C is constant of integration; k x = XI k and is a function of A. 
Both C and are to be set by boundary conditions on g(A), the 
baseline for labor productivity across cities at a given time. From 
measurement ofg(A), we then obtain A, and the relationship between 
productivity, diversity and connectivity, since A = D 0 k 0 . 

These results derive the aggregate patterns of urban occupational 
diversity for all city sizes from microscopic considerations of the 
division and coordination of labor in cities and show the sense in 
which the patterns of diversity observed empirically above are 
optimal through their relation to individual productivity under net- 
work constraints. 

Discussion 

We showed that US cities express a general statistical pattern of 
professions, where greater individual specialization coexists with 
increases in overall diversity as cities become larger. We demon- 
strated that these patterns of diversity are optimal, in the sense of 
providing the basis for open-ended increases in economic produc- 
tivity per capita with city size while preserving the overall function- 
ality available to each individual. This dynamics of differentiation 
and coordination at the individual level promotes increases in the 
quality of these functions available in larger cities and thus in urban 
systems by encouraging learning and the accumulation of human 
capital, enabled by the scaling properties of urban social networks. It 
also provides a general microscopic theory, based on fundamental 
principles of economic organization 32,33 , that derives observed 
increasing returns to scale in economic productivity at the metropol- 
itan level consistent with data for thousands of cities worldwide 30 . 

These ideas for the origin of functional diversity unify important 
concepts about the role of cities as generators of heterogeneity and 
innovation. The view of cities as "social reactors" from complex 
systems theory 30 provides the underpinnings for this dynamics by 
deriving the social and infrastructural conditions (coordination 
costs) under which the process of specialization can happen and 
remain openended. They also provide a context for ideas of innova- 
tion as a recombinatory process 37 , a framework often invoked to 
explain technological development: The growth in social connectiv- 
ity with city size and the possibilities for the division of labor it 
affords, promote the appearance of explicit new functions, such as 
new professions. These functions can then be recombined in orga- 
nizations and products, which in turn may stimulate the develop- 
ment of new functions and so on. Thus, the information about more 
elaborate products and services becomes increasingly encoded in 
social networks, rather than being contained in the knowledge of 
single individuals. Given inevitable individual cognitive constraints 38 
this seems to be a necessary condition for socioeconomic complexity 
to develop as an open-ended process. 

The framework developed here also unifies two distinct points of 
view that are often thought to be at odds: Growing diversity, in this 
picture, is the emergence of greater specialization. This demonstrates 
how diversity and specialization can be two facets of the same process 
for realizing increasing productivity gains through knowledge spe- 
cialization in social organizations. In this picture, it is the transfer of 
already existing incipient functions from generalist individuals to 
growing social networks of interdependent specialists, see Fig. 3B, 
that supports their close integration and coordination. It also shows 
how system size ('the extent of the market' 32 as the metropolitan area) 
is key in enabling these dynamics as it determines, indirectly 30 , both 
productivity gains and coordination costs 33 . 

Where these two points of view - diversity and specialization - 
differ is in the unit of analysis at which they take place. Here, we 
showed that patterns of differentiation are predominately local in 
social networks and can be observed at the individual level. (The 



original division of labor in human societies - by sex and by age - 
takes place in closely-knit cohabiting groups 39 ). However, this does 
not exclude that processes of specialization and interdependence 
may also take place at higher levels of organization, such as firms, 
cities or even nations, whenever coordination costs are sufficiently 
low. In fact, a similar analysis at the establishment level (places of 
employment) reveals that there is less specialization in larger cities (a 
smaller increase in diversity) in terms of business types relative to 
professions 40 . This result implies that businesses classified as the 
same type in cities of different sizes contain, on the average, a more 
diverse set of occupations and realize greater productivity (e.g. in 
terms of value-added 41 ) in larger cities. At the national level, scale 
(population) plays less of a role in determining economic diversity or 
productivity 8 (c.f. China or India vs. the Netherlands or Singapore), 
though these two characteristics remain firmly correlated, and inti- 
mately tied to levels of urbanization. In fact, urban hierarchy theories 
from central place theory 28 to more recent economic geography 29 , 
assume the existence of a general hierarchy of 'services' common to 
all cities that grows deeper with city size. In this picture, as we have 
demonstrated quantitatively here, the economic diversity, D, of a 
national economy is set by that of its largest city. 

This evidence suggests that the development of fundamental the- 
ory linking diversity, productivity and scale in complex systems must 
be guided by the integrated analysis of microscopic productivity 
gains obtained via specialization and learning with associated coor- 
dination costs mediated by (social) networks of interdependence 33 . 
They also show how resilience deriving from greater diversity may be 
an unintended consequence of a general process of knowledge spe- 
cialization in complex networks. Similar analysis based on different 
socioeconomic units (such as firms, cities or nations) may shed light 
on the necessary conditions for these entities to realize analogous 
patterns of growth and development and help establish a more gen- 
eral quantitative framework for this sort of analysis in other complex 
systems. In this respect, it is likely an important clue to theory that 
similar quantitative patterns of functional diversity characterize the 
technological complexity of at least some simpler (pre-urban) 
human societies 42 and may be a property of other networked systems 
that can experience increases in their productivity with scale. The 
reversibility of these processes, e.g. the absence of hysteresis in the 
externalization and reabsorption of functions by individuals inter- 
connected by networks may also underlie the resilience of many 
complex systems 5 ' 20,43 under functional change or population loss. 

Methods 

Urban Units of Analysis. We adopt the standard definition of functional cities as 
metropolitan statistical areas (MSAs). MSAs are collections of political units 
(counties) aggregated by the US Census Bureau based on a set of criteria that includes 
population size, density and commuting flows. There were 403 MSAs in 2010, 
providing an ample basis for studying occupational diversity across city size (from 
50K to 20M inhabitants) as well as other urban characteristics. MSAs are integrated 
labor markets and the best current official definition of functional cities in terms of a 
mixing population 30 . 

Data Sources. Data on professional occupations in US MSAs used is the 
Occupational Employments Statistics (OES) obtained from the Bureau of Labor 
Statistics (BLS) 44 and is freely available online (http://www.bls.gov/oes/). Because The 
OES survey is a semi-annual mail survey of non-farm establishments it does not 
include self- employment. All occupations reported by employers in the US economy 
are hierarchically classified on the basis of their similarities at different levels of 
aggregation based on the BLS's Standard Occupational Classification Scheme, which 
contains a total of 840 distinct professions at its finest (r=6-digit) level of resolution 
for 2010. It is important to notice that while this classification scheme is the finest 
occupation classification available at the MSA level, it does not actually represents a 
job that someone does, but instead a group of job titles preformed on average by 7 
specific jobs. Such even finer classification is only available at the national level 
(http://www.bls.gOv/soc/#match). However, the inclusion of such definition do not 
alter the conclusions reached here as it only multiplies the diversity of occupation by a 
factor of 7 on average (See Supplementary Fig. S.4 and Table S.3). 

Fit methodology. For each year, we estimated the parameters in Eq. 1 at r =6, 5, 4, 
and 3-digits level occupations (Fig. 1A-B) through ordinary least squares, using the 
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Gauss-Newton method, which relies on linear approximations to the nonlinear mean 
function 45 . We then used y, d 0 , N 0 estimated for 2010 to fit Eq. 2. For visual purposes, a 
constant i 0 = 3 is introduced to Eq. 2 to account for the initial curvature observed for 
most common occupations (Fig. 2B). Hence the fit in Fig. 2B corresponds to 

f(i\ N) = NN~ 1 [(do - i - i 0 )(i + i 0 ) ~ l ] 1/7 . Note that i 0 is not determined by the 
process of analytic continuation, valid only at high ranks, used to obtain the form of 
the frequency distribution and constitutes from that point of view a functional 
freedom that can be set by the boundary condition of observing/(z; N) for sufficiently 
small cities, where D(N) is of order i 0 . 
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